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Abstract 

We study the convergence rate of randomly truncated stochastic algorithms, which 
consist in the truncation of the standard Robbins-Monro procedure on an increasing se- 
quence of compact sets. Such a truncation is often required in practice to ensure conver- 
gence when standard algorithms fail because the expected- value function grows too fast. 
In this work, we give a self contained proof of a central limit theorem for this algorithm 
under local assumptions on the expected-value function, which are fairly easy to check in 
practice. 

Key words: stochastic approximation, central limit theorem, randomly truncated 
stochastic algorithms, martingale arrays. 

1 Introduction 

The use of stochastic algorithms is widespread for solving stochastic optimization prob- 
lems. These algorithms are extremely valuable for a practical use and particularly well suited 
to localize the zero of a function u. Such algorithms go back to the pioneering work of 
Robbins and Monro (1951), who considered the sequence 

X n+ i = X n - -y n+ iu(X n ) - >y n+1 5M n+1 (1) 

to estimate the zero of the function u. The sequence (j n )n classically denotes the gain or 
step sequence of the algorithm and {5M n ) n depicts a random measurement error. Neverthe- 
less, the assumptions required to ensure the convergence — basically, a sub- linear growth 
of u on average — are barely satisfied in practice, which dramatically reduces the range of 
applications. Chen and Zhu (1986) proposed a modified algorithm to deal with fast growing 
functions. Their new algorithm can be summed up as 

X n+ \ =X n - -y n+ iu(X n ) - 7„ + i(5M n+ i + j n+ ip n+1 (2) 

where (p n ) n is a truncation term ensuring that the sequence (X n ) n cannot jump too far ahead 
in one step. 

In this paper, we are concerned with the rate of convergence of Equation (2). Numerous 
results are known for the sequence defined by Equation (1), which is known to converge at 



*Laboratoire Jean Kuntzmann, Universite de Grenoble et CNRS, BP 53, 38041 Grenoble Cedex 9, 
FRANCE, e-mail : jerome.lelong@imag.fr 



1 



2 



Randomly truncated stochastic algorithms 



the rate ^/7Vi when j n is of the form ^ with 1/2 < a < 1 (see Delyon (1996), Duflo (1997) 
or Buche and Kushner (2001) for instance). When jn = ^ and ^he Hessian matrix at the 
optimum is of the form XI, Duflo (1997) showed that the convergence rate depends on the 
relative position of A and ^. A functional central limit theorem for this algorithm was proved 
by Bouton (1985) and Benveniste et al. (1990). The convergence rate of constrained algo- 
rithms was studied by Kushner and Yin (2003). The problem of multiple targets was tackled 
by Pelletier (1998) who proved a Central Limit Theorem. However, very few results are 
known about the convergence rate of the algorithm devised by Chen and Zhu (1986). Chen 
(2002) briefly studied the convergence rate under global hypotheses on the noise sequence 
(5M n ) n . Here, we aim at giving a clarified, self-contained and elementary proof of this re- 
sult under local assumptions (see Section 2.3 for a detailed comparison of the two results). 
Besides giving a clarified and self-contained proof of the central limit theorem for randomly 
truncated algorithm, the improvement brought by our work is the use of the local condi- 
tion sup n E[|<5M n | 2+p l{|x„_i-x*|<»;}] < 00 with some p > and 7] > replacing the global 
condition sup n E[\5M n \ 2+p ] < oo. 

First, we define the general framework and explain the algorithm developed by 
Chen and Zhu (1986). Our main results are stated in Theorems 2.1 and 2.2 (see page 4) 
depending on the decreasing speed of the sequence (-y n )n- In Section 2.3, we discuss the im- 
provements brought by our new results and we give a concrete example to show the benefits 
of using local assumptions. Section 3 is devoted to the proof of the main results. 



2 A CLT for randomly truncated stochastic algorithms 

It is quite common to look for the root of a continuous function u: x E M. d i — > u(x) G M d , 
which is not easily tractable. We assume that we can only access u up to a measurement error 
embodied in the following by the sequence (SM n ) n and that the norm |it(x)| 2 grows faster 
than \x\ 2 such that the standard Robbins-Monro algorithm (see Equation (1)) quickly fails. 
Instead, we consider the alternative procedure introduced by Chen and Zhu (1986). This 
technique consists in forcing the algorithm to remain in an increasing sequence of compact 
sets (Kj)j such that 

oo 

|J Kj = R d and Vj, K s £ int(K j+1 ) . 

j=0 

It prevents the algorithm from blowing up during the first iterates. Let {"f n )n be a decreasing 
sequence of positive real numbers satisfying J2nln = oo and J2nln < 00 • F° r x o £ ^ d and 
(To = 0, we define the sequences of random variables (X n ) n and (a n ) n by 

{ X n+\ = X n - 7n+lu(X n ) - J n+1 SM n+ l, 
if X n+\ G K <rn X n+l = X n+\ and a n+l = <?n, (3) 
if x n+ l i K <?n x n+i = X and a n+1 =a n + l. 

Let T n denote the a— algebra generated by (SMk,k < n), F n = a{5Mk,k < n). We assume 
that {5M n ) n is a sequence of martingale increments, i.e. E(5M n +x|J r ra ) = 0. 

Remark 1. X.i is actually drawn from the dynamics of the Robbins-Monro algorithm (see 
Equation (1)). If the standard algorithm wants to jump too far ahead it is reset to a fixed 
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value. When X n ,i ^ K Un , one can set X n+ \ to any measurable function of (Xq, . . . , X n ) 
with values in a given compact set. The existence of such a compact set is crucial to prove 
the a.s. convergence of (X n ) n . 

It is more convenient to rewrite Equation (3) as follows 

X n+ i = X n - 7 n+ iti(X n ) - j n+1 5M n+1 + 7„ + ip n+ i (4) 

where 

p n+1 = ( u{X n ) + 5M n+l + (X -X n ))lf v 

In this paper, we only consider gain sequences of the type 7„ = T^jjna ; with 1/2 < a < 1. If 
a = 1, we obtain a slightly different limit. For values of a outside this range, the almost sure 
convergence is not even guarantied. 

2.1 Hypotheses 

In the following, the prime notation stands for the transpose operator. We introduce the 
following hypotheses. 

(Al) i. 3x* E R d s.t. u(x*) = and Vx G R d , x / a*, (a - x*) ■ u(x) > 0. 

ii. There exist a function y : W d — > M. dxd satisfying limi^i^o |y (a;) | = and a sym- 
metric positive definite matrix A such that 

u{x) = A{x — x*) + y(x — x*)(x — x*). 

(A2) For any q > 0, the series J2n ln+i5M n+ il{\ Xn -x*\<q} converges almost surely. 
(A3) i. There exist two real numbers p > and rj > such that 

k = sup IE ( \8M n \ 2+p l{|X„_ 1 -x*|<r,}) < oo. 
ii. There exists a symmetric positive definite matrix £ such that 
E (5M n 5M' n \F n ^) lflx,^-^,,} S. 

(A4) There exists \x > such that Vn > 0, d(x*, dK n ) > p. 
Remark 2. Comments on the assumptions. 

1. Hypothesis (Al-i) is satisfied as soon as u can be interpreted as the gradient of a strictly 
convex function. The Hypothesis (Al-ii) is equivalent to saying that u is differentiable 
at x* . 

2. Hypothesis (A2) ensures that X n — > x* a.s. and a n is almost surely finite, see Lelong 
(2009) for a proof of this result. 

3. Hypothesis (A3-i) corresponds to some local uniform integrability condition and re- 
minds of Lindeberg's condition. ( A3-ii) guaranties the convergence of the angle bracket 
of the martingale of interest. 

4. Hypothesis (A4) is only required for technical reasons but one does not need to be con- 
cerned with it in practical applications. It reminds of the case of constrained stochastic 
algorithms for which the CLT can only be proved for non saturated constraints. 
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2.2 Main results 

For n > 0, we define the renormalized and centered error 

A n = n . 

A CLT for 1/2 < a < 1 

Theorem 2.1. If we assume Hypotheses (Al) to (A4), the sequence (A n ) converges in 
distribution to a normal random variable with mean and covariance 



POO 

V = I exp (-At)E exp (-At)dt. 
Jo 



A CLT for a = 1 

Theorem 2.2. We assume Hypotheses (Al) to (A4) and 

(A 5) 7 A — ^1 is positive definite. 

Then, the sequence (A n ) converges in distribution to a normal random variable with mean 
and covariance 

V = 7 J™ exp ( - jAj t\ S exp (Y- - (it. 

Remark 3. Hypothesis (A5) involves the gradient of function u at the point x*, which is 
seldom tractable from a practical point of view but one can definitely not avoid it. The 
positivity of "/A — hi is the border of two different convergence regimes as already noted by 
Duflo (1997) for the Robbins-Monro algorithm. 



2.3 Discussion around the assumptions of Theorem 2.2 

Theorem 2.2 is actually an extension of Chen (2002, Theorem 3.3.1). The main improve- 
ments brought by our new result concern the conditions imposed on the noise term. Our 
Assumption (A3) is weaker than the one imposed by Chen (2002, A3. 3. 3) since we only as- 
sume local conditions on the noise terms; namely, unlike Chen, we only need to monitor the 
behavior of (5M n ) n in a small neighborhood of the optimum x* (see Assumptions (A3-i) and 
(A3-ii)). Moreover, we only assume the local convergence in probability of the angle bracket of 
the martingale of interest built with (8M n ) n whereas Chen (2002, Equation (3.3.22)) requires 
the almost sure convergence which may be a little harder to prove in practical applications. 

Concerning Assumption (Al-ii), it essentially means that u must be differentiable at x*. 
This is to be compared to the Holder continuity property of the remainder of the first order 
expansion of u at x* required by Chen (2002, A3. 3. 4), which is not so obvious to check in 
practice. Our goal in this work was not only to state a theorem with weaker assumptions 
but also to present a self contained and elementary proof of a central limit theorem for 
truncated stochastic algorithms. In particular, Lemma 3.1 provides a smart way of handling 
the truncation terms. 

Let us us consider an example which often arises in practice (see for instance Arouna 
(Winter 2003/2004) and Lelong (2007)). Assume the function u is defined as an expectation 
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u(x) = E(U(x,Z)) where Z is a random vector, then we can for instance take 5M n+ i = 
U(X n , Z n+ \) — u{X n ) with (Z n ) n an i.i.d. sequence of random vectors following the law of 
Z. With this choice and if we further assume that for all q > 0, sup| a ,| <(? E(|[/(x, Z)\ 2JrP ) < oo 
and that the function x i — > E([/(x, Z)U(x, Z)') is continuous at x*, then it is obvious that 
Assumptions (A2) and (A3) are satisfied. Note that in this particular but widely used setting, 
there is no assumption to be checked along the paths of the algorithm as it was the case in 
the results of Chen. This considerably widens the range of applications as the assumptions 
of our theorems boils down to basic regularity properties of the function U. Note also that 
we do not impose any condition on the behaviour of the sequence (p n ) n , i-e. on the choice of 
the compact subsets {K n ) n . 



3 Proofs of Theorems 2.1 and 2.2 

In this section, we prove the Theorems presented in Section 2.2 through a series of three 
lemmas. The proofs of these lemmas are postponed to Section 3.2. 



3.1 Technical lemmas 



For any fixed n > 0, we introduce s U: k = J2i=o ln+i for /c > and we set s n fl = 0. 
(s n ,k)k>o can be interpreted as a discretisation grid of [0, oo) because lim^oo s n ^ = oo. 
Theorems 2.1 and 2.2 are based on the following three lemmas. 

Lemma 3.1. Let e > and r/ > as in Hypothesis (A3). There exists No > 0, such that if 
we define for n > Nq 



sup \X m - x*\ < 7] 

n>m>No 



then 



P(A»)>l-e Vn>iV and sup E | A n p lr A x < oo. 

n>AT V ' 



Lemma 3.2. For any integers t > and n > 

t-l 



t-i 



A 

where 



= e- s -< Q A n - £ e O("».*-».*) ^^l5M n+k+l - £ e 

fc=0 fc=0 



7n+fc-Rn+fc,(5) 



if a = \, 



Q 



A-^-I 



-y(X m - x*)A m + -j=L= p m+1 
+7m(a m I + b m {A + y{X m - x*)) + 0{j m ))A r 



(6) 



if 1/2 < a < 1, 



A 



y(X m - x*))A m - — L=p m+1 

^-{a m I + b m j n (A + y(X m - x*)))A m + 0(j m )A r 



(7) 



with {a n ) n and (b n ) n two real valued and bounded sequences. 
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Moreover, the last term in (5) tends to zero in probability. 

Lemma 3.3. In Equation (5), the sequence (J2k=o e ( ^ <ySn ' k ~ Sn ' t ^ ^/7n+fc^-^n+fc)t converges 
in distribution to M(0, V n ) for any fixed n when p goes to infinity, where V n = 

V?° -V , > p~Q s n,k Y, P -Qs„.k 
Z^fc^/n+fcc ' Z^e 

Proof of Theorems 2.1 and 2.2. Let us consider Equation (5) for a fixed n > Nq, where Nq 
is denned in Lemma 3.1. Because the matrix Q is definite positive and A n is almost surely 
finite, e~ SnJ QA n tends to zero almost surely when t goes to infinity. Thanks to Lemma 3.2, 
the last term in Equation (5) tends to zero in probability when t goes to infinity. 

Combining these two convergences in probability to zero with Lemma 3.3 yields the 
convergence in distribution of (A n+ t)t to a normal random variable with mean and variance 
V when p goes to infinity, where V is defined in Lemma 3.3. Plugging the value of the matrix 
Q (see Equations (6) and (7)) in the expression of V yields the result. ■ 

Note that the proof for the classical Robbins Monro algorithm is much simpler since we 
do not need to introduce the A n sets, which are only used here to handle the truncation 
terms. 



3.2 Proofs of the lemmas 

3.2.1 Proof of Lemma 3.1 

We only do the proof in the case a = 1, as in the other case, it is sufficient to slightly 
modify a few Taylor expansions and the same results still hold. From Equation (4), we have 
the following recursive relation 



A n+1 = Xn+l = J—^- A n - J^ n+1 {u{X n ) + 5M n+1 - p n+ i). 

y/ln+l V 7n+l 

Using Hypothesis (Al-ii), the previous equation becomes 



n+iln(A + y(X n - x*))j A n - y/j n+ i5M n+1 + y'Tn+iPn+i- (8) 
The following Taylor expansions hold 

^I = l + ^ + ( 7 2) and V7^TT = 7n + 0(7n)- (9) 
There exist two real valued and bounded sequences (a n ) n and (b n ) n such that 



In i , In 



V 7n+l 2 7 

This enables us to simplify Equation (8) 



1 + 77- + 1n a n and V7n7n+1 = In + J n b n . 



A n+ i =A n - 7„QA n - -f n y(X n - x*)A n - y/j n+1 5M n+ i 

+ ^rip n+ i + jl{a n I + b n {A + y{X n - x*)))A n , (10) 
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where Q = A — Let A n+ i 



X 



2 V^+i 

value of the new iterate obtained before truncation. 



where X „ + 1, defined by Equation (3), is the 



: |A n - 7nQA n - 7„y(X n - x*)A n - v /7 n+ i<5M n+ i 

+ 7 2(a„I + b n (A + y(X n - x*)J))A, 
<|A n - 7„QA n - -f n y{X n - x*)A„| 2 + 7 n+ i|(5M n+ i| 2 
+ 7 ^|(a n 7 + b n (A + y(X n - ^)/))A„| 2 
+ 2 7n (A n - 7n QA n - j n y(X n - x*)A n )'dM n+1 

+ 2^{{a n I + b n (A + y(X n - x*)I))A n )'(A n - 7„QA n - j n y(X n - x*)A n ) 
+ 2^J 2 5M' n+l {a n I + b n (A + y(X n - x*)I))AM n . 

If we take the conditional expectation with respect to J- n — denoted E n — in the previous 
equality, we find 



En 



<\A n - jnQA n - j n y(X n - x*)A n \ 2 + ~f n+1 E n \5M n+1 \ 2 

+ 7^ ||a n 7 + b n (A + y(X n - x*)I)f \A n \ 2 

+ 2 7 * || (a„/ + 6 n (A + y(X n - x*)I))\\(l + 7 n ||Q + y(X n 



IA I 2 

^n • 



En 



< \A n \ 2 -2 ln A n '(Q + y(X n - x*))A n + 7n+1 E n |5M n+1 | 
+ o( 7 ^(l + |y(X n -^)|)|A n | 2 . 



(11) 



Note that in the previous equation the quantity O (7 2 ) is non random. 

Let A > be the smallest eigenvalue of Q, which is symmetric definite positive. Since 
]im.\ x \_ ¥0 y(x) = 0, there exists rj > such that for all \x\ < rj, \y(x)\ < A/2. We assume that 
this value of rj satisfies Hypothesis (A3). Let e > 0. Since (X n ) n converges almost surely to 
x*, there exists a rank Nq such that 

P( SUp \X m - X*\ >?])<£. 

m>N 

Hence, F(A n ) > 1 - e for all n > Nq. 

On the set A n , Q + y{X n — x*) is a positive definite matrix with smallest eigenvalue 
greater than A/2. Therefore A n '(Q + y(X n — x*))A n > A/2 |A n | 2 . Hence, we can deduce from 
Equation (11) that 



E 



{An} 



E (|A n | 2 l {An} ) < - 7n AE (| A n | 2 l {An} ) + j n K 



+ 0( 7 ^)(1 + -A)E |A„ri {An} 



We can assume that for n > Nq, |C(7 2 )(1 + A/2) | < 7„A/2. Hence we get, for n > Nq, 



E 



A. 



E ( \A n \' l {An} ) < - j n -E { \A n \ z l {An} \ + ln K 
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Since A n+ i C A r 



E 



A 



n+i 



) ~ E (| A„r 1{A„}J < "7n (|A n | 2 l {An} j + K 7n , (12) 
Now, we would like to replace A n+ i by A n+ i in Equation (12). 



IA 



\X n - x* 



n+l\ 



*n+l\ 



7n+l 



-l{ Pn+1 ^o} + 



l {p n +i=0}, 



K+ k 



2 X - X* 
+ - 

In 



Taking the conditional expectation w.r.t. T n on the set A n gives 



E n |A n+1 | 2 < E n 



E n |A„ +1 | 2 l {An} < E, 



2 \Xq - X* 



A 



n+i 



7n+l 



X n+i £K an \T n ), 



2 \Xq - X* 



E(|A n 



+l| 



< E 



A 



n+i 



ln+1 

\X a - x* 



+ 



ln+l 



A n n{X n+h tK a J).(13) 



The probability on the right hand side can be rewritten 



A n n {X n+ i <£ K an }j - E [l{j n+1 | M (X„)+«5A/ n+ i|>d(A n ,0i^ CTn )} 1 {A n } 

Moreover using the triangle inequality, we have d(X n , dK an ) > d(x*, dK an ) — \X n — x*\. Due 
to Hypothesis (A4), d (x* , dK an ) > \i and on A n , \X n — x*\ < rj. Hence, d(X n ,dK an ) > 
[i — i]. One can choose rj < fx/2 for instance, so that d(X n , dK an ) > ^. 

A n n {X n+h i K a j) < E (E n (l {7n+i HXn)+SMn+im ) l {An} ) , 

< *&± E (\u(X n )\ 2 l {An} + \5M n+1 \ 2 )l {An} ) . (14) 

Thanks to Hypothesis (A3) and the continuity of u, the expectation on the r.h.s of (14) is 
bounded by a constant c > independent of n. So, we get 



Hence, from Equation (13) we can deduce 



E |A n+1 ri {An+l} <E 



A, 



(15) 



By combining Equations (15) and (12), we come up with 



E ( |A n+ i| 1{a„ + i} ) < l-7n7T E |A n ri {An} +c 7n , 



where c = c + k. 
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Let X = {t > JV : -|l (|A;| 2 1 {A . } ) + c> o}, then 



supE (|Aj| 



2c 

Ha*} ) < — < oo 



A 



Note that we can always assume that 2c/ X > E (JAjv | }J> such that the set I is non 

empty. Assume i ^ X, let io = sup{/c < i : A; € X}. 



E (|A,| 2 - E (|AJ 2 l {Aio} ) <Y^jJ c -^E (|A fc | 2 l {Ak} 

k=io 

Since all the terms for k = iq + 1, . . . ,i — 1 are negative and io 6 X, we find 

2c 



E(|A,| 2 l {yli} ) < 7i0 c+-. 



A 

Finally, we come with the following upper bound. 

sup E ( |A n | 2 1/4 i J < oo. 

n>N V 7 

Remark 4 (case 1/2 < a < 1). This proof is still valid for a < 1 if we replace the Taylor 
expansions of Equation (9) by 



In . a n 7„6 n 

" H and ^jnln+i = 7 n + 



y 7n+i n n 

Then, Equation (10) becomes 

A n+ i =A n - -y n QA n - -y n y{X n - x*)A n - ^J^ n+1 5M n+1 

+ J^ n+ ip n+ i + -(a n J + b n j n (A + y(X n - x*))A n , 
n 

with Q = A this time, which is still positive definite. 

3.2.2 Proof of Lemma 3.2 

Let us go back to Equation (10). For any n > Nq and k > 0, we can write 

A n+ A: =A n+fc _x - 7 n+fe _ 1 <5A n+fc _ 1 — ^/7 n+ fei5M n+ fe + ^ n+ k-iR n +k-\ 

where 

_ 1 

Rm = -y{X m - x*)A m + p m+ i + j m (a m I + b m (A + y(X m - x )))A m , 

V7m+1 

We can actually notice that the previous equation pretty much looks like a discrete time ODE. 
Based on this remark, it is natural to multiply the previous equation by e Sn , k Q to find 

e s n , k Q An+k _ ( e s„, fe Q _ e s n , k Q ln+k _ x Q) A n+jfc _! = - e s ^ kQ ^-^5M n+k + 7 n+fc _! e s ^ kQ R n+k -i 
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Note that e Sn ' k ® — e Sn > k ® 7 n +fc_iQ = e Sn ^- l( ^(l + 0(7^, fe _ 1 )). Hence, we come up with the 
following equation 

e s n , k Q An+k _ e s n , k -!Q A n+fc _! = - e s "- fcQ ^^SM n+k + 7n+fc _i e s "' kQ R n+k -i 

where 

R m = -y(X m - x*)A m + 1 p m+ i + 7 m (a m J + 6 m (^ + y{X m - x*)) + 0(l))A m , (16) 

When summing the previous equalities for = 1, . . . , t — 1 for any integer t > 0, we get 

t-l t-i 
A n+t = e~ s ^ A n - J2 e^- s ^)Q ^^5M n+k+1 - £ e ^ k ~ s ^ j n+ kRn + k, 

k=0 k=0 

Let us a have a closer look at the different terms of Equation (16) 

• lim m y(X m — x*)A m l^x m -x*\>r)} = a - s - thanks to the a.s. convergence of (X m ) m and 
using Lemma 3.1, the sequence (y(X m — x*)A m l^ Xm -x*\<rj})m is uniformly integrable 
and tends to zero in probability because lim m y(X m — x*) = a.s. 

• p m is almost surely equal to for m large enough thanks to Remark 2, so — ==p m = 
a.s. for m large enough. 

• 1m(a m I + b m (A + y(X m - x*)) + C(l))A m l|| Xm _ x .*| >r;} — > almost surely because 
for m large enough the indicator equals 0. The sequence j m (a m I + b m (A + y(X m — 
x*)) + 0(l))A m l^ Xm ~x*\<ri} is uniformly integrable by Lemma 3.1 and tends to zero 
in probability because 7 m — > 0. 

Hence, R m can be split in two terms : one tending to zero almost surely and an other one which 
is uniformly integrable and tends to zero in probability. Then, we can apply Propositions A.l 
and A. 2 to prove the convergence in probability of {Y! k 2 ^ k ~ s ^ t)Q In+kR +k)t- This last 
point ends the proof of Lemma 3.2. 



3.2.3 Proof of Lemma 3.3 

To prove Lemma 3.3, we need a result on the rate of convergence of martingale arrays. 
First, note that for — 1= 6M n l{\x n ^ 1 —x*\} tends to a.s. when n goes to infinity because 
l{|x n _i-z*|} = f° r n large enough. Then, it ensues from Proposition A.l that 
£fc=o eQ(Sn,fc ~ Sn ' i) v /7n+fc^ M n+fc 1 {|x n+fc _ 1 -x-|>r ? } converges to zero in probability when t goes 
to infinity. Henceforth, it is sufficient to prove a localized version of Lemma 3.3 by 
considering J2l=o e Q ^ k ~ s "^ y /r^ k '5M n+k l { \ Xn+k _ i _ x ,^ v} . 

We will use the following Central Limit Theorem for martingale arrays adapted from 
Duflo (1997, Theorem 2.1.9). 

Theorem 3.4. Suppose that {(J r i)o<l<t',t > 0} is a family of titrations and {(A/)o<Kt; t > 0} 
a square integrable martingale array with respect to the previous filtration. Assume that : 

+ IP 

(A 6) there exists a symmetric positive definite matrix T such that (N) t > F. 

t— >oc 
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(A7) There exists p > such that 



E E (k/- iv *-ir + H JC ?-i 

1=1 v 



t— >co 



-> 0. 



at* -A+A/-(o,r). 

t— >oo 



Using this theorem, we can now prove Lemma 3.3. 



Proof of Lemma 3.3. For the sake of clearness, we will do the proof considering that Q is 
a non-negative real constant instead of a positive definite matrix. Let us define Nf for all 
< I < t and t > 



N t = £ e (s n , k -s n , t )Q ^— mn+kl ^ x ^_ x ^ 
k=l 

(Nf)o<i< p is obviously a martingale with respect to (.F n+ j)j. Let us compute its angle bracket 



(N)l = ^e 2 ^^-^)«7n+ fe IE(5M n 2 +fc l { | Xri+fc _ 1 _ :c *|< 7?} |J n+fc _ 1 



(17) 



k=l 



Thanks to Hypotheses (A3), the conditional expectation in (17) is uniformly integrable and 
converges in probability to £ when k goes to infinity. Applying Proposition A. 2 proves the con- 
vergence in probability of (N)l to lim^oo YX=i e 2( - Sn > k ~ Sn '^ Q 7 n+ fcX = J2k=i e~ 2s "> fcQ 7 n +fc£. 
Let p be the real number defined in Theorem 2.1. 



i=i 



3 (2+p)(s n , fc -s„, t )Q ~, 1+ 2T 



fc=l 



7^ + fc converges to when A: goes to infinity and the sequence of expectations is bounded 

using Hypothesis (A3), so 7^ +fe E ( |<5M n+ fe| +p ^-{\x n+k _ 1 -x*\<-q\ ) tends to zero when goes 
to infinity. Proposition A.l proves that the l.h.s. of Equation (18) tends to when t goes 

to infinity. Hence, £i=i E (\Nf - Nf^ +P tends to zero in L , and consequently in 

probability. Then, the Hypotheses of Theorem 3.4 are satisfied. 
Finally, we have proved that 



fc=0 



t— >oo 1 i — ' 



k=l 
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4 Conclusion 

In this work, we have proved a Central Limit Theorem with rate y / 7^ for randomly trun- 
cated stochastic algorithms under local assumptions. We have also tried to clarify the proof of 
the convergence rate of randomly truncated stochastic algorithms under assumptions which 
can be easily verified in practice. The improvement brought by this new set of assumptions is 
that all they should only be checked in a neighbourhood of the target value x*, which means 
that in the case where u(x) = K(U(x, Z)) the assumptions can be reformulated in terms of 
some local regularity properties of U. 



A Some elementary results 



Here are two results used in the proofs of Theorems 2.1 and 2.2. 



Proposition A.l. Let (Y n ) n be a sequence of random vectors ofM. d converging almost surely 
to a non random vector x E R d . For any fixed integer n > and positive definite matrix 



Q G 



pdxd 



, we 



Jq°° e Q u du x almost surely. 



define, for all integers t>0, Z t = ELo e Q ^ k Sn ^ j n+k Y n+k . Then, lim t Z t 



Proof. It is clear that limt_ s . 00 Jo™'* e ® u du x = e ® u du x. Hence, it is sufficient to 
consider 



Z t - I " e~ Qu dux 
o 



< ^ n + k 

k=0 



s O(»n,fe-*n,t) 



Y n +k 



+ 



t 



fc=0 



Q{s n ,k~ Sn,t) _ 



- Qu du 



X . 



(19) 



Let q > (resp. q > 0) be the smallest (resp. greatest) eigenvalue of Q. 

Step 1 : We will prove that the first term in Equation (19) tends to almost surely. 



fc=0 



3 Q(Sn,fc-Sn,t) 



\Y, 



n+k 



s t r 

— J s n,k — 1 



,g( s n,k—Sn,t) \V 

v— J. y 



n+k 



x\ du 



k=0 

< / 

Jo 



£(«-*«,*) e£7„+. nW \Y n+Tn{u) - x\ du (20) 



where for any real number u > t n (u) is the largest integer k such that s n ^-i < u < s nj fc. 
Note that lim^+oo t n (u) = +oo. lim u ^ +oc e^ 7 "+ T «(") \Y n+Tn ^—x\ = a.s., hence it is obvious 
that the term on the r.h.s of Equation (20) tend to almost surely. 

Step 2 : We will now prove that the second term in Equation (19) tends to 0. 
We use the convention s n -\ = and recall that s ntk = s n ^-i + 7n+fc- Note that 



J. Lelong 



13 



jsn.t e q u ^ u _ jsn.t e Q( u s n , t ) ^ h ence the following inequality holds 



^2"fn+ke Q{Sn ' k ~ Sn ' t] - 



k=0 



<E 

k=0 JSn - k ~ 1 

I 



e Q(Sn,k-Sn,t) _ e Q{u-S n ,t) 



du 



<E 

I 



a Q(«— S n ,i) 



3 Q(s„,jfc-u) _J 



du. 



fc=0 S "' fe - 1 

Let e > 0, there exits Ti > such that for all t > Ti, (e«T«+* -1) < e, hence for all i > Ti, 



g(«-s„,t) / 57, 



(e«T«+* -l)du < 2 T"'" e2 (tt_8n ' t) (e«-l)d« + e ]T e^ -5 "^ du, 

</ e g^Sn,t)^ e q_ 1 ^ du + £ du, 

JO Js n .Ti 



1 \ e q — 1 1 

q q' 



There exists T 2 > Ti such that for all t > T 2 , ( e £ (s "' T i~ s ™< t) - e"^ n ' t )(e <? -1) < e, hence for all 
t >T 2 , 



E 



2 c 



fe=0 ^.fc- 1 



This ends to prove that the second term in Equation (19) tends to when t goes to infinity. ■ 

Proposition A. 2. The conclusion of Proposition A. I still holds if we assume that the se- 
quence (Y n ) n is uniformly integrable and if it converges in probability to a non random vector 
x e R n . 

Proof. We recall the decomposition given by Equation (19) 



Z t - I e~ Qu du x 



t 

^ E 

k=0 

t 

+ " 



Y n -\-k %\ 



E^+* 

k=0 



Q{Sn,kS n ,t) _ I ' e ~Q U du 





X . 



The last term in the above equation has already been proved to tend to in the proof of Propo- 



sition A.l Step 2. So, we only need to prove that lim u ^ +oc ,Ylk=o^n+k e^ Sn ' fe Sn '^ 
x\ = in probability. 

Let q > (resp. q > 0) be the smallest (resp. greatest) eigenvalue of Q. 
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k=0 



IK 



+k ~ x \ 



<E 



S„,k 



3ff(««,fc-«»,t) |y n+fc _ x | dn 



fc = J Sn,k-l 



< 



where for any real number u > 0, is the largest integer k such that s n fc_i < u < s n fc. 

Let Y& = 7fc|Kc — x\. The sequence (Kfe)fe tends to zero in probability, is uniformly integrable 
and positive. 



E 



or* e£(u " sn ' t) yn+r " (u) du ) = sr e ~ u ~ sn ' t] ^n+^du. (2i) 



Since (Kfc)& is uniformly integrable and converges to in probability, lim^+oo K(Y n+Tn ^) = 
0, hence the term on the r.h.s of Equation (21) tends to when t goes to infinity. This proves 
that lim n _> +00 J2k=o7n+k e Q ( Sn ' k ~ Sn -^ |K+fc — x| = in L 1 and in probability. ■ 
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